Search results for "discretized learning"

showing 3 items of 3 documents

On incorporating the paradigms of discretization and Bayesian estimation to create a new family of pursuit learning automata

2013

Published version of an article in the journal: Applied Intelligence. Also available from the publisher at: http://dx.doi.org/10.1007/s10489-013-0424-x There are currently two fundamental paradigms that have been used to enhance the convergence speed of Learning Automata (LA). The first involves the concept of utilizing the estimates of the reward probabilities, while the second involves discretizing the probability space in which the LA operates. This paper demonstrates how both of these can be simultaneously utilized, and in particular, by using the family of Bayesian estimates that have been proven to have distinct advantages over their maximum likelihood counterparts. The success of LA-…

Bayes estimatorLearning automataDiscretizationbusiness.industryComputer scienceMaximum likelihoodBayesian probabilityestimator algorithmsBayesian reasoningEstimatorlearning automataBayesian inferencediscretized learningVDP::Mathematics and natural science: 400::Information and communication science: 420::Knowledge based systems: 425Artificial Intelligenceε-optimalityArtificial intelligencepursuit schemesbusinessAlgorithm
researchProduct

A novel strategy for solving the stochastic point location problem using a hierarchical searching scheme

2014

Stochastic point location (SPL) deals with the problem of a learning mechanism (LM) determining the optimal point on the line when the only input it receives are stochastic signals about the direction in which it should move. One can differentiate the SPL from the traditional class of optimization problems by the fact that the former considers the case where the directional information, for example, as inferred from an Oracle (which possibly computes the derivatives), suffices to achieve the optimization-without actually explicitly computing any derivatives. The SPL can be described in terms of a LM (algorithm) attempting to locate a point on a line. The LM interacts with a random environme…

Continuous-time stochastic processMathematical optimizationOptimization problemControlled random walkTime reversibilityDiscretized learning02 engineering and technologyTime reversibilityLearning automataStochastic-point problem0202 electrical engineering electronic engineering information engineeringElectrical and Electronic EngineeringStochastic neural networkMathematicsBinary treeLearning automata020206 networking & telecommunicationsRandom walkComputer Science ApplicationsHuman-Computer InteractionControl and Systems Engineering020201 artificial intelligence & image processingStochastic optimizationSoftwareInformation Systems
researchProduct

Discretized Bayesian Pursuit – A New Scheme for Reinforcement Learning

2012

Published version of a chapter in the book: Advanced Research in Applied Artificial Intelligence. Also available from the publisher at: http://dx.doi.org/10.1007/978-3-642-31087-4_79 The success of Learning Automata (LA)-based estimator algorithms over the classical, Linear Reward-Inaction ( L RI )-like schemes, can be explained by their ability to pursue the actions with the highest reward probability estimates. Without access to reward probability estimates, it makes sense for schemes like the L RI to first make large exploring steps, and then to gradually turn exploration into exploitation by making progressively smaller learning steps. However, this behavior becomes counter-intuitive wh…

Scheme (programming language)Mathematical optimizationDiscretizationLearning automataComputer sciencebusiness.industryVDP::Mathematics and natural science: 400::Information and communication science: 420::Algorithms and computability theory: 422estimator algorithmsBayesian probabilityBayesian reasoninglearning automataEstimatorVDP::Technology: 500::Information and communication technology: 550discretized learningBayesian inferenceAction (physics)Reinforcement learningArtificial intelligencepursuit schemesbusinesscomputercomputer.programming_language
researchProduct